Likelihood Probability Mismatch Analysis and Normalization in Multilingual Speech Applications

نویسندگان

  • Bin Ma
  • Cuntai Guan
  • Haizhou Li
چکیده

In this paper, with a multilingual speech recognition system, we exam the HMM likelihood scores among the different acoustic models and observe that there exist scoring mismatches. The mismatches might come from different recording environments in which the training data for each language were collected, or come from different acoustic modeling structures. This analysis helps us understand the gaps among the likelihood probabilities on these acoustic models. Based on the observation of the differences of likelihood probability scores from different languages, we study a simple frame based likelihood probability normalization method to balance the likelihood scores of multiple acoustic models in the recognition system. Experiments show that this normalization method is effective to compensate the likelihood probability biases that come from different training corpora and different acoustic structures.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Methods in Multilingual Speech Recognition

One key issue in developing learning methods for multilingual acoustic modeling in large vocabulary automatic speech recognition (ASR) applications is to maximize the benefit of boosting the acoustic training data from multiple source languages while minimizing the negative effects of data impurity arising from language “mismatch”. In this paper, we introduce two learning methods, semiautomatic...

متن کامل

Robust Speech Features and Acoustic Models for Speech Recognition

This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) systems against noise distortions. The study is important as the performance of ASR systems degrades dramatically in adverse environments, and hence greatly limits the speech recognition application deployment in realistic environments. Towards this end, we examine a feature compensation approach and...

متن کامل

Normalization in the acoustic feature space for improved speech recognition

In this work, normalization techniques in the acoustic feature space are studied which improve the robustness of automatic speech recognition systems. It is shown that there is a fundamental mismatch between training and test data which causes degraded recognition performance. Adaptation and normalization, basic strategies to reduce the mismatch, are introduced and placed into the framework of ...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

A minimax classification approach with application to robust speech recognition

AbstructA minimax approach for robust classification of parametric information sources is studied and applied to isolatedword speech recognition based on hidden Markov modeling. The goal is to reduce the sensitivity of speech recognition systems to a possible mismatch between the training and testing conditions. To this end, a generalized likelihood ratio test is developed and shown to be optim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002